Speaker Recognition in an Emotional Environment

نویسندگان

  • Marius Vasile Ghiurcau
  • Corneliu Rusu
  • Jaakko Astola
چکیده

The goal of this paper is to assess the effect of emotional state of a speaker when text-independent speaker identification is performed. Mel-frequency cepstral coefficients are the features of the speech signal used for speaker recognition. For training the speaker models and testing the system, Support Vector Machines are employed. Berlin emotional speech database, which contains 10 different speakers recorded in different emotional situations (happy, angry, fear, bored, sad and neutral) is used. The results show an important influence of the emotional state upon textindependent speaker identification. A possible solution to this issue is finally suggested.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Recognizing the Emotional State Changes in Human Utterance by a Learning Statistical Method based on Gaussian Mixture Model

Speech is one of the most opulent and instant methods to express emotional characteristics of human beings, which conveys the cognitive and semantic concepts among humans. In this study, a statistical-based method for emotional recognition of speech signals is proposed, and a learning approach is introduced, which is based on the statistical model to classify internal feelings of the utterance....

متن کامل

A Comparative Study of Gender and Age Classification in Speech Signals

Accurate gender classification is useful in speech and speaker recognition as well as speech emotion classification, because a better performance has been reported when separate acoustic models are employed for males and females. Gender classification is also apparent in face recognition, video summarization, human-robot interaction, etc. Although gender classification is rather mature in a...

متن کامل

Speaker Identification in Emotional Environments

The performance of speaker identification is almost perfect in the neutral environment. However, the performance is significantly deteriorated in emotional environments. In this work, three different and separate models have been used, tested and compared to identify speakers in each of the neutral and emotional environments (completely two separate environments). Our emotional environments in ...

متن کامل

Automatic Speech Emotion and Speaker Recognition based on Hybrid GMM and FFBNN

In this paper we present text dependent speaker recognition with an enhancement of detecting the emotion of the speaker prior using the hybrid FFBN and GMM methods. The emotional state of the speaker influences recognition system. Mel-frequency Cepstral Coefficient (MFCC) feature set is used for experimentation. To recognize the emotional state of a speaker Gaussian Mixture Model (GMM) is used ...

متن کامل

Applying pitch-dependent difference detection and modification to emotional speaker recognition

Emotion is an internal source, which can cause the speaker recognition system performance degradation by inducing extra intra-speaker vocal variability. Several enhancements have been applied to speaker recognition system under emotional speech. However, these methods suffer from the limitation of requiring the emotional speech in training or the emotion state of the speaker in testing. This pa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011